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As improvements in processor performance continue to far outpace improvements in 
storage performance, I/O is increasingly the bottleneck in computer systems, especially in 
large database systems that manage huge amoungs of data. The key to achieving good 
I/O performance is to thoroughly understand its characteristics. In this article we present a 
comprehensive analysis of the logical I/O reference behavior of the peak 
productiondatabase workloads from ten of the world's largest corporatio ... 

Keywords: I/O, TPC benchmarks, caching, locality, prefetching, production database 
workloads, reference behavior, sequentially, workload characterization 
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July 1990 Proceedings of the second international symposium on Databases in 

parallel and distributed systems 
Publisher: ACM Press 
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Full text available: ^ pdf(2.50 MB) 



With current systems, some important complex queries may take days to complete 
because of: (1) the volume of data to be processed, (2) limited aggregate resources. 
Introducing parallelism addresses the first problem. Cheaper, but powerful computing 
resources solve the second problem. According to a survey by Brodie,l only 10% of 
computerized data is in data bases. This is an argument for both more variety and volume 
of data to be moved into data base systems. We conject ... 
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